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POLYNUCLEOTIDES ENCODING HUMAN PROTEASE HOMOLOGS 

The present application claims the benefit of U.S. 
5 Provisional Application Number 60/171,566 which was filed on 
December 22, 1999 and is herein incorporated by reference in 
its entirety. 

1 . INTRODUCTION 
The present invention relates to the discovery, 

10 identification, and characterization of novel human 

polynucleotides encoding proteins sharing sequence similarity 
with mammalian proteases. The invention encompasses the 
described polynucleotides, host cell expression systems, the 
encoded protein, fusion proteins, polypeptides and peptides, 

15 antibodies to the encoded proteins and peptides, and 

genetically engineered animals that either lack or over 
express the disclosed sequences, antagonists and agonists of 
the proteins, and other compounds that modulate the expression 
or activity of the proteins encoded by the disclosed 

20 polynucleotides that can be used for diagnosis, drug 

screening, clinical trial monitoring and the treatment of 
physiological disorders . 

2 . BACKGROUND OF THE INVENTION 
25 Proteases cleave protein substrates as part of 

degradation, maturation, and secretory pathways within the 
body. Proteases have been associated with, inter alia, 
regulating development, modulating cellular processes, 
fertility, and infectious disease. 

30 

3 . SUMMARY OF THE INVENTION 
The present invention relates to the discovery, 
identification, and characterization of nucleotides that 
encode novel human proteins, and the corresponding amino acid 
35 sequences of these proteins. The novel human proteins (NHPs) 
described for the first time herein share structural 
similarity with animal proteases, and particularly trypsin- 
like proteases such as oviductin . 
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The novel human nucleic acid (cDNA) sequences described 
herein, encode a proteins /open reading frames (ORFs) of 306, 
302, and 164 amino acids in length (see SEQ ID NOS : 2, 4, and 
6 respectively) . 

5 The invention also encompasses agonists and antagonists 

of the described NHPs, including small molecules, large 
molecules, mutant NHPs, or portions thereof that compete with 
native NHPs, NHP peptides, and NHP antibodies, as well as 
nucleotide sequences that can be used to inhibit the 

10 expression of the described NHPs (e.g., antisense and ribozyme 
molecules, and gene or regulatory sequence replacement 
constructs) or to enhance the expression of the described NHPs 
(e.g., expression constructs that place the described sequence 
under the control of a strong promoter system) , and transgenic 

15 animals that express a NHP transgene, or "knock-outs" (which 
can be conditional) that do not express a functional NHP . 

Further, the present invention also relates to processes 

i 

for identifying compounds that modulate, i.e., act as agonists 
or antagonists, of NHP expression and/or NHP activity that 

20 utilize purified preparations of the described NHP and/or NHP 
product, or cells expressing the same. Such compounds can be 
used as therapeutic agents for the treatment of any of a wide 
variety of symptoms associated with biological disorders or 
imbalances . 

25 - 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
The Sequence Listing provides the sequences of the NHP 
ORFs encoding the described NHP amino acid sequences. SEQ ID 
NO: 7 describes an NHP ORF with flanking sequences. 

30 

5. DETAILED DESCRIPTION OF THE INVENTION 
The NHPs, described for the first time herein, are novel 
proteins that are expressed in, inter alia, human cell lines, 
and human thymus, trachea, kidney, prostate, testis, thyroid, 
35 salivary gland, stomach, placenta, mammary gland, adipose, 

skin, esophagus, bladder, pericardium, and fetal kidney cells. 
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The described sequences were compiled from gene trapped 
cDNAs and clones isolated from a human kidney cDNA library 
(Edge Biosystems, Gai thersburg , MD) . The present invention 
encompasses the nucleotides presented in the Sequence Listing, 
5 host cells expressing such nucleotides, the expression 

products of such nucleotides, and: (a) nucleotides that encode 
mammalian homologs of the described sequences, including the 
specifically described NHPs, and the NHP products; 
(b) nucleotides that encode one or more portions of a NHP that 

10 correspond to functional domains of the NHP , and the 

polypeptide products specified by such nucleotide sequences, 
including but not limited to the novel regions of any active 
domain (s) ; (c) isolated nucleotides that encode mutant 
versions, engineered or naturally occurring, of a described 

15 NHP in which all or a part of at least one domain is deleted 
or altered, and the polypeptide products specified by such 
nucleotide sequences, including but not limited to soluble 
proteins and peptides in which all or a portion of the signal 
sequence is deleted; (d) nucleotides that encode chimeric 

20 fusion proteins containing all or a portion of a coding region 
of a NHP, or one of its domains {e.g., a receptor or ligand 
binding domain, accessory protein/ self -association domain, 
etc.) fused to another peptide or polypeptide; or (e) 
therapeutic or diagnostic derivatives of the described 

25 polynucleotides such as oligonucleotides, antisense 

polynucleotides, ribozymes, dsKNA, or gene therapy constructs 
comprising a sequence first disclosed in the Sequence Listing. 

As discussed above, the present invention includes: 
(a) the human DNA sequences presented in the Sequence Listing 

3 0 (and vectors comprising the same) and additionally 

contemplates any nucleotide sequence encoding a contiguous NHP 
open reading frame (ORF) , or a contiguous exon splice junction 
first described in the Sequence Listing, that hybridizes to a 
complement of a DNA sequence presented in the Sequence Listing 

35 under highly stringent conditions, e.g., hybridization to 
filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate 
(SDS) , 1 mM EDTA at 65 °C, and washing in 0.1xSSC/0.1% SDS at 
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68°C (Ausubel F.M. et al . , eds . , 1989, Current Protocols in 
Molecular Biology, Vol. I, Green Publishing Associates, Inc., 
and John Wiley & sons, Inc., New York, at p. 2.10.3) and 
encodes a functionally equivalent gene product. Additionally 
5 contemplated are any nucleotide sequences that hybridize to 
the complement of the DNA sequence that encode and express an 
amino acid sequence presented in the Sequence Listing under 
moderately stringent conditions, e.g., washing in 0.2xSSC/0.1% 
SDS at 42°C (Ausubel et al . , 1989, supra), yet still encode a 

10 functionally equivalent NHP product. Functional equivalents 
of a NHP include naturally occurring NHPs present in other 
species and mutant NHPs whether naturally occurring or 
engineered (by site directed mutagenesis, gene shuffling, 
directed evolution as described in, for example, U.S. Patent 

15 No. 5,837,458). The invention also includes degenerate 

nucleic acid variants of the disclosed NHP polynucleotide 
sequences . 

Additionally contemplated are polynucleotides encoding a 
NHP ORF , or its functional equivalent, encoded by a 

20 polynucleotide sequence that is about 99/ 95, 90, or about 85 
percent similar or identical to corresponding regions of the 
nucleotide sequences of the Sequence Listing (as measured by 
BLAST sequence comparison analysis using, for example, the GCG 
sequence analysis package using standard default settings). 

25 The invention also includes nucleic acid molecules, 

preferably DNA molecules, that hybridize to, and are therefore 
the complements of, the described NHP nucleotide sequences. 
Such hybridization conditions may be highly stringent or less 
highly stringent, as described above. In instances where the 

30 nucleic acid molecules are deoxyoligonucleotides ("DNA 

oligos" ), such molecules are generally about 16 to about 100 
bases long, or about 20 to about 80, or about 34 to about 45 
bases long, or any variation or combination of sizes 
represented therein that incorporate a contiguous region of 

35 sequence first disclosed in the Sequence Listing. Such 
oligonucleotides can be used in conjunction with the 
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polymerase chain reaction (PCR) to screen libraries, isolate 
clones, and prepare cloning and sequencing templates, etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing 
5 gene expression patterns (particularly using a micro array or 
high- throughput "chip" format) . Additionally, a series of the 
described NHP oligonucleotide sequences, or the complements 
thereof, can be used to represent all or a portion of the 
described NHP sequences. An oligonucleotide or polynucleotide 

10 sequence first disclosed in at least a portion of one or more 
of the sequences of SEQ ID NOS : 1-7 can be used as a 
hybridization probe in conjunction with a solid support 
matrix/substrate (resins, beads, membranes, plastics, 
polymers, metal or metallized substrates, crystalline or 

15 polycrystalline substrates, etc.). Of particular note are 
spatially addressable arrays (i.e., gene chips, microtiter 
plates, etc.) of oligonucleotides and polynucleotides, or 
corresponding oligopeptides and polypeptides, wherein at least 
one of the biopolymers present on the spatially addressable 

20 array comprises an oligonucleotide or polynucleotide sequence 
first disclosed in at least one of the sequences of SEQ ID 
NOS: 1-7, or an amino acid sequence encoded thereby. Methods 
for attaching biopolymers to, or synthesizing biopolymers on, 
solid support matrices, and conducting binding studies thereon 

25 are disclosed in, inter alia, U.S. Patent Nos. 5,700,637, 
5,556,752, 5,744,305, 4,631,211, 5,445,934, 5,252,743, 
4,713,326, 5,424,186, and 4,689,405 the disclosures of which 
are herein incorporated by reference in their entirety. 

Addressable arrays comprising sequences first disclosed 

30 in SEQ ID NOS:l-7"can be used to identify and characterize 
the temporal and tissue specific expression of a sequence. 
These addressable arrays incorporate oligonucleotide sequences 
of sufficient length to confer the required specificity, yet 
be within the limitations of the production technology. The 

35 length of these probes is within a range of between about 8 to 
about 2000 nucleotides. Preferably the probes consist of 60 



5 



WO 01/46407 • PCT/US00/33738 

nucleotides and more preferably 2 5 nucleotides from the 
sequences first disclosed in SEQ ID NOS:l-7. 

For example, a series of the described oligonucleotide 
sequences, or the complements thereof, can be used in chip 
5 format to represent all or a portion of the described 

sequences. The oligonucleotides, typically between about 16 
to about 40 (or any whole number within the stated range) 
nucleotides in length can partially overlap each other and/or 
the sequence may be represented using oligonucleotides that do 

10 not overlap. Accordingly, the described polynucleotide 

sequences shall typically comprise at least about two or three 
distinct oligonucleotide sequences of at least about 8 
nucleotides in length that are each first disclosed in the 
described Sequence Listing. Such oligonucleotide sequences 

15 can begin at any nucleotide present within a sequence in the 
Sequence Listing and proceed in either a sense (5'-to-3') 
orientation vis-a-vis the described sequence or in an 
antisense orientation. 

Microarray-based analysis allows the discovery of broad 

20 patterns of genetic activity, providing new understanding of 
gene functions and generating novel and unexpected insight 
into transcriptional processes and biological mechanisms. The 
use of addressable arrays comprising sequences first disclosed 
in SEQ ID NOS:l-7 provides detailed information about 

2 5 transcriptional changes involved in a specific pathway, 

potentially leading to the identification of novel components 
or gene functions that manifest themselves as novel 
phenotypes . 

Probes consisting of sequences first disclosed in SEQ ID 
30 NOS:l-7 can also be used in the identification, selection and 
validation of novel molecular targets for drug discovery. The 
use of these unique sequences permits the direct confirmation 
of drug targets and recognition of drug dependent changes in 
gene expression that are modulated through pathways distinct 

3 5 from the drugs intended target. These unique sequences 

therefore also have utility in defining and monitoring both 
drug action and toxicity. 
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As an example of utility, the sequences first disclosed 
in SEQ ID N0S:l-7 can be utilized in microarrays or other 
assay formats, to screen collections of genetic material from 
patients who have a particular medical condition. These 
5 investigations can also be carried out using the sequences 
first disclosed in SEQ ID NOS:l-7 in silico and by comparing 
previously collected genetic databases and the disclosed 
sequences using computer software known to those in the art. 

Thus the sequences first disclosed in SEQ ID NOS:l-7 can 

10 be used to identify mutations associated with a particular 
disease and also as a diagnostic or prognostic assay. 

Although the presently described sequences have been 
specifically described using nucleotide sequence, it should be 
appreciated that each of the sequences can uniquely be 

15 described using any of a wide variety of additional structural 
attributes, or combinations thereof. For example, a given 
sequence can be described by the net composition of the 
nucleotides present within a given region of the sequence in 
conjunction with the presence of one or more specific 

20 oligonucleotide sequence (s) first disclosed in the SEQ ID NOS : 
1-7. Alternatively, a restriction map specifying the relative 
positions of restriction endonuclease digestion sites, or 
various palindromic or other specific oligonucleotide 
sequences can be used to structurally describe a given 

25 sequence. Such restriction maps, which are typically 

generated by widely available computer programs (e.g., the 
University of Wisconsin GCG sequence analysis package, 
SEQUENCHER 3.0, Gene Codes Corp., Ann Arbor, MI, etc.), can 
optionally be used in conjunction with one or more discrete 

30 nucleotide sequence (s) present in the sequence that can be 

described by the relative position of the sequence relatve to 
one or more additional sequence (s) or one or more restriction 
sites present in the disclosed sequence. 

For oligonucleotide probes, highly stringent conditions 

35 may refer, e.g., to washing in 6xSSC/0.05% sodium 

pyrophosphate at 37°C {for 14-base oligos), 48°C (for 17-base 
oligos) , 55°C (for 20-base oligos), and 60 °C (for 23-base 
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oligos) . These nucleic acid molecules may encode or act as 
NHP sequence antisense molecules, useful, for example, in NHP 
gene regulation (for and/or as antisense primers in 
amplification reactions of NHP gene nucleic acid sequences) . 
5 With respect to NHP gene regulation, such techniques can be 
used to regulate biological functions. Further, such 
sequences may be used as part of ribozyme and/or triple helix 
sequences that are also useful for NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides 
10 can additionally comprise at least one modified base moiety 
which is selected from the group including but not limited to 
5-f luorouracil , 5-bromouracil , 5-chlorouracil , 5-iodouracil , 
hypoxanthine , xantine , 4-acetylcytosine, 

5- (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl - 

15 2-thiouridine , 5-carboxymethylaminomethyluracil , 

dihydrouracil , beta-D-galactosylqueosine , inosine, 
N6-isopentenyladenine, 1-methylguanine , 1-methylinosine , 
2, 2-dimethylguanine, 2-methyladenine , 2 -methylguanine , 
3 -methyl cy t o s ine , 5 -methyl cy t o s ine , N6 - adenine , 

20 7 -methylguanine , 5 -methylaminomethyluraci 1 , 

5-methoxyaminomethyl--2-thiouracil , beta-D-mannosylqueosine , 
5'-methoxycarboxymethyluracil , 5-methoxyuracil , 
2 -methyl thio-N6-isopentenyladenine , uracil- 5 -oxyacetic acid 
(v) , wybutoxosine, pseudouracil , queosine,- 2- thiocytosine , 

25 5-methyl-2-thiouracil, 2-thiouracil , 4- thiouracil , 

5-methyluracil, uracil -5- oxyacetic acid methylester, uracil- 
5-oxyacetic acid (v) , 5-methyl-2- thiouracil , 3- ( 3 -amino-3-N-2- 
carboxypropyl ) uracil, (acp3)w, and 2 , 6-diaminopurine . 

The antisense oligonucleotide can also comprise at least 

3 0 one modified sugar moiety selected from the group including 

but not limited to arabinose, 2 -f luoroarabinose , xylulose, and 
hexose . 

In yet another embodiment, the antisense oligonucleotide 
will comprise at least one modified phosphate backbone 
3 5 selected from the group consisting of a phosphorothioate , a 

phosphorodithioate, a phosphoramidothioate , a phosphoramidate , 
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a phosphordiamidate, a methylphosphonate , an alkyl 
phosphotriester , and a formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide 
is an a-anomeric oligonucleotide. An a-anomeric 
5 oligonucleotide forms specific double-stranded hybrids with 

complementary RNA in which, contrary to the usual p -units, the 
strands run parallel to each other (Gautier et al . , 1987, 
Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a 2 '-0- 
methylribonucleotide (Inoue et al . , 1987, Nucl. Acids Res. 
10 15:6131-6148), or a chimeric RNA-DNA analogue (Inoue et al . , 

1987, FEBS Lett. 215:327-330). Alternatively, double stranded 
RNA can be used to disrupt the expression and function of a 
targeted NHP . 

Oligonucleotides of the invention can be synthesized by 

15 standard methods known in the art, e.g. by use of an automated 
DNA synthesizer (such as are commercially available from 
Biosearch, Applied Biosystems, etc.). As examples, 
phosphorothioate oligonucleotides can be synthesized by the 
method of Stein et al . (1988, Nucl. Acids Res. 16:3209), and 

20 methylphosphonate oligonucleotides can be prepared by use of 
controlled pore glass polymer supports (Sarin et al . , 1988, 
Proc. Natl. Acad. Sci. U.S.A. 85 : 7448-7451 ) , etc. 

Low stringency conditions are well known to those of 
skill in the art, and will vary predictably depending on the 

25 specific organisms from which the library and the labeled 

sequences are derived. For guidance regarding such conditions 
see, for example, Sambrook et al . , 1989, Molecular Cloning, A 
Laboratory Manual (and periodic updates thereof), Cold Springs 
Harbor Press, N.Y.; and Ausubel et al . , 1989, Current 

30 Protocols in Molecular Biology, Green Publishing Associates 
and Wiley Interscience , N.Y. 

Alternatively, suitably labeled NHP nucleotide probes can 
be used to screen a human genomic library using appropriately 
stringent conditions or by PCR. The identification and 

35 characterization of human genomic clones is helpful for 
identifying polymorphisms (including, but not limited to, 
nucleotide repeats, microsatellite alleles, single nucleotide 
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polymorphisms, or coding single nucleotide polymorphisms), 
determining the genomic structure of a given locus /allele , and 
designing diagnostic tests. For example, sequences derived 
from regions adjacent to the intron/exon boundaries of the 
5 human gene can be used to design primers for use in 

amplification assays to detect mutations within the exons, 
introns, splice sites (e.g., splice acceptor and/or donor 
sites), etc., that can be used in diagnostics and 
pharmacogenomics . 

10 Further, a NHP homolog can be isolated from nucleic acid 

from an organism of interest by performing PCR using two 
degenerate or "wobble" oligonucleotide primer pools designed 
on the basis of amino acid sequences within the NHP products 
disclosed herein. The template for the reaction may be total 

15 RNA, mRNA, and/or cDNA obtained by reverse transcription of 
mRNA prepared from human or non-human cell lines or tissue 
known or suspected to express an allele of a NHP gene. 

The PCR product can be subcloned and sequenced to ensure 
that the amplified sequences represent the sequence of the 

20 desired NHP. The PCR fragment can then be used to isolate a 
full length cDNA clone by a variety of methods. For example, 
the amplified fragment can be labeled and used to screen a 
cDNA library, such as a bacteriophage cDNA library. 
Alternatively, the labeled fragment can be used to isolate 

2 5 genomic clones via the screening of a genomic library. 

PCR technology can also be used to isolate full length 
cDNA sequences. For example, RNA can be isolated, following 
standard procedures, from an appropriate cellular or tissue 
source (i.e., one known, or suspected, to express a NHP gene, 
30 such as, for example, testis tissue). A reverse transcription 
(RT) reaction can be performed on the RNA using an 
oligonucleotide primer specific for the most 5' end of the 
amplified fragment for the priming of first strand synthesis. 
The resulting RNA/DNA hybrid may then be "tailed" using a 

3 5 standard terminal transferase reaction, the hybrid may be 

digested with RNase H, and second strand synthesis may then be 
primed with a complementary primer. Thus, cDNA sequences 

10 
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upstream of the amplified fragment can be isolated. For a 
review of cloning strategies that can be used, see e.g., 
Sambrook et al . , 1989, supra. 

A cDNA encoding a mutant NHP gene can be isolated, for 
5 example, by using PCR. In this case, the first cDNA strand 
may be synthesized by hybridizing an oligo-dT oligonucleotide 
to mRKfA isolated from tissue known or suspected to be 
expressed in an individual putatively carrying a mutant NHP 
allele, and by extending the new strand with reverse 

10 transcriptase. The second strand of the cDNA is then 
synthesized using an oligonucleotide that hybridizes 
specifically to the 5' end of the normal gene. Using these 
two primers, the product is then amplified via PCR, optionally 
cloned into a suitable vector, and subjected to DNA sequence 

15 analysis through methods well known to those of skill in the 
art. By comparing the DNA sequence of the mutant NHP allele 
to that of a corresponding normal NHP allele, the mutation (s) 
responsible for the loss or alteration of function of the 
mutant NHP gene product can be ascertained. 

20 Alternatively, a genomic library can be constructed using 

DNA obtained from an individual suspected of or known to carry 
a mutant NHP allele (e.g., a person manifesting a NHP- 
associated phenotype such as, for example, obesity, high blood 
pressure, connective tissue disorders, infertility, etc.), or 

25 a cDNA library can be constructed using RNA from a tissue 

known, or suspected, to express a mutant NHP allele. A normal 
NHP gene, or any suitable fragment thereof, can then be 
labeled and used as a probe to identify the corresponding 
mutant NHP allele in such libraries. Clones containing mutant 

30 NHP gene sequences can then be purified and subjected to 

sequence analysis according to methods well known to those 

skilled in the art. 

Additionally, an expression library can be constructed 
utilizing cDNA synthesized from, for example, RNA isolated 
35 from a tissue known, or suspected, to express a mutant NHP 

allele in an individual suspected of or known to carry such a 
mutant allele. In this manner, gene products made by the 

11 
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putatively mutant tissue can be expressed and screened using 
standard antibody screening techniques in conjunction with 
antibodies raised against normal NHP product, as described 
below. (For screening techniques, see, for example, Harlow, 
5 E. and Lane, eds . , 1988, "Antibodies: A Laboratory Manual", 
Cold Spring Harbor Press, Cold Spring Harbor.) 

Additionally, screening can be accomplished by screening with 
labeled NHP fusion proteins, such as, for example, alkaline 
phosphatase-NHP or NHP-alkaline phosphatase fusion proteins. 

10 In cases where a NHP mutation results in an expressed gene 

product with altered function [e.g., as a result of a missense 
or a frameshift mutation) , polyclonal antibodies to NHP are 
likely to cross-react with a corresponding mutant NHP gene 
product. Library clones detected via their reaction with such 

15 labeled antibodies can be purified and subjected to sequence 
analysis according to methods well known in the art. 

The invention also encompasses (a) DNA vectors that 
contain any of the foregoing NHP coding sequences and/or their 
complements (i.e., antisense) ; (b) DNA expression vectors that 

20 contain any of the foregoing NHP coding sequences operatively 
associated with a regulatory element that directs the 
expression of the coding sequences (for example, baculo virus 
as described in U.S. Patent No. 5,869,336 herein incorporated 
by reference) ; (c) genetically engineered host cells that 

2 5 contain any of the foregoing NHP coding sequences operatively 

associated with a regulatory element that directs the 
expression of the coding sequences in the host cell; and (d) 
genetically engineered host cells that express an endogenous 
NHP sequence under the control of an exogenously introduced 
30 regulatory element (i.e., gene activation). As used herein, 

regulatory elements include, but are not limited to, inducible 
and non- inducible promoters, enhancers, operators and other 
elements known to those skilled in the art that drive and 
regulate expression. Such regulatory elements include but are 

3 5 not limited to the human cytomegalovirus (hCMV) immediate 

early gene, regulatable, viral elements (particularly 
retroviral LTR promoters) , the early or late promoters of SV4 0 

12 
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adenovirus, the lac system, the trp system, the TAC system, 
the TRC system, the major operator and promoter regions of 
phage lambda, the control regions of fd coat protein, the 
promoter for 3 -phosphoglycerate kinase (PGK) , the promoters of 
5 acid phosphatase, and the promoters of the yeast a-mating 
factors . 

The present invention also encompasses antibodies and 
anti-idiotypic antibodies (including Fab fragments), 
antagonists and agonists of a NHP, as well as compounds or 

10 nucleotide constructs that inhibit expression of a NHP gene 
(transcription factor inhibitors, antisense and ribozyme 
molecules, or gene or regulatory sequence replacement 
constructs), or promote the expression of a NHP {e.g., 
expression constructs in which NHP coding sequences are 

15 operatively associated with expression control elements such 
as promoters, promo ter/ enhancers , etc. ) . 

The NHPs or NHP peptides, NHP fusion proteins, NHP 
nucleotide sequences, antibodies, antagonists and agonists can 
be useful for the detection of mutant NHPs or inappropriately 

20 expressed NHPs for the diagnosis of disease. The NHP proteins 
or peptides, NHP fusion proteins, NHP nucleotide sequences, 
host cell expression systems, antibodies, antagonists, 
agonists and genetically engineered cells and animals can be 
used for screening for drugs (or high throughput screening of 

25 combinatorial libraries) effective in the treatment of the 
symptomatic or phenotypic manifestations of. perturbing the 
normal function of a NHP in the body. The use of engineered 
host cells and/or animals may offer an advantage in that such 
systems allow not only for the identification of compounds 

30 that bind to the endogenous receptor for a NHP, but can also 
identify compounds that trigger NHP-mediated activities or 
pathways . 

Finally, the NHP products can be used as therapeutics. 
For example, soluble derivatives such as NHP peptides /domains 
35 corresponding to NHP, NHP fusion protein products (especially 
NHP-Ig fusion proteins, i.e., fusions of a NHP, or a domain of 
a NHP, to an IgFc) , NHP antibodies and anti-idiotypic 
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antibodies (including Fab fragments) , antagonists or agonists 
(including compounds that modulate or act on downstream 
targets in a NHP-mediated pathway) can be used to directly 
treat diseases or disorders. For instance, the administration 
5 of an effective amount of soluble NHP, or a NHP-IgFc fusion 

protein or an anti- idiotypic antibody (or its Fab) that mimics 
the NHP could activate or effectively antagonize the 
endogenous NHP receptor. Nucleotide constructs encoding such 
NHP products can be used to genetically engineer host cells to 

10 express such products in vivo; these genetically engineered 
cells function as ""bioreactors" in the body delivering a 
continuous supply of a NHP, a NHP peptide, or a NHP fusion 
protein to the body. Nucleotide constructs encoding 
functional NHP, mutant NHPs, as well as antisense and ribozyme 

15 molecules can also be used in "gene therapy" approaches for 
the modulation of NHP expression. Thus, the invention also 
encompasses pharmaceutical formulations and methods for 
treating biological disorders. 

Various aspects of the invention are described in greater 

20 detail in the subsections below. 

5 . 1 THE NHP SEQUENCES 
The cDNA sequences (SEQ ID NO : 1, 3, and 5) and the 
corresponding deduced amino acid sequences of the described 

2 5 NHP are presented in the Sequence Listing. SEQ ID NO : 7 

describes a NHP ORF as well as flanking regions. The NHP 
nucleotides were obtained from human cDNA libraries using 
probes and/or primers generated from human gene trapped 
sequence tags. Expression analysis has provided evidence that 

30 the described NHP can be expressed a variety of human cells as 
well as gene trapped human cells. In addition, the described 
NHP sequences can contain a variety of polymorphisms such as 
at nucleotide 68 of SEQ ID NO : 1 and nucleotide 56 of SEQ ID 
NO: 3 which both can be a G or an A that can give rise to 

35 corresponding arg or gin at amino acid position 23 of SEQ ID 
NO: 2, or residue 19 of SEQ ID NO : 4 . The described NHP 
sequences can also contain A-G polymorphisms at nucleotide 82 
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of SEQ ID NO:l and nucleotide 7 0 of SEQ ID NO : 3 which can give 
rise to a corresponding ala or thr at amino acid position 28 
of SEQ ID NO: 2, or residue 24 of SEQ ID NO : 4 . The described 
NHPs share similarity with trypsin-like proteases, plasminogen 
5. activators, and human plasma kallikrein precursor. 

5.2 NHPs AND NHP POLYPEPTIDES 
NHPs, polypeptides, peptide fragments, mutated, 
truncated, or deleted forms of the NHPs, and/or NHP fusion 
10 proteins can be prepared for a variety of uses. These uses 

include, but are not limited to, the generation of antibodies, 
as reagents in diagnostic assays, for the identification of 
other cellular gene products related to a NHP, as reagents in 
assays for screening for compounds that can be as 
15 pharmaceutical reagents useful in the therapeutic treatment of 
mental, biological, or medical disorders and disease. 

The Sequence Listing discloses the amino acid sequence 
encoded by the described NHP polynucleotides. The NHPs 
display initiator methionines in DNA sequence contexts 
20 consistent with a translation initiation site, and display a 

consensus signal sequence. 

The NHP amino acid sequences of the invention include the 
amino acid sequences presented in the Sequence Listing as well 
as analogues and derivatives thereof, as well as any 

25 oligopeptide sequence of at least about 10-40, generally about 
12-35, or about 16-30 amino acids in length first disclosed in 
the Sequence Listing. Further, corresponding NHP homologues 
from other species are encompassed by the invention. In fact, 
any NHP encoded by the NHP nucleotide sequences described 

30 above are within the scope of the invention, as are any novel 
polynucleotide sequences encoding all or any novel portion of 
an amino acid sequence presented in the Sequence Listing. The 
degenerate nature of the genetic code is well known, and, 
accordingly, each amino acid presented in the Sequence 

35 Listing, is generically representative of the well known 

nucleic acid "triplet" codon, or in many cases codons, that 
can encode the amino acid. As such, as contemplated herein, 



15 



WO 01/46407 f PCI7US00/33738 

the amino acid sequences presented in the Sequence Listing, 
when taken together with the genetic code (see, for example, 
Table 4-1 at page 109 of "Molecular Cell Biology", 1986, J. 
Darnell et al . eds . , Scientific American Books, New York, NY, 
5 herein incorporated by reference) are generically 
representative of all the various permutations and 
combinations of nucleic acid sequences that can encode such 
amino acid sequences. 

The invention also encompasses proteins that are 

10 functionally equivalent to the NHPs encoded by the presently 
described nucleotide sequences as judged by any of a number of 
criteria, including, but not limited to, the ability to bind 
and cleave a substrate of a NHP, or the ability to effect an 
identical or complementary downstream pathway, or a change in 

15 cellular metabolism (e.g., proteolytic activity, ion flux, 

tyrosine phosphorylation, etc.). Such functionally equivalent 
NHP proteins include, but are not limited to, additions or 
substitutions of amino acid residues within the amino acid 
sequence encoded by the NHP nucleotide sequences described 

2 0 above, but which result in a silent change, thus producing a 

functionally equivalent gene product. Amino acid 
substitutions can be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity , hydrophilici ty , 
and/or the amphipathic nature of the residues involved. For 

25 example, nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, 
and glutamine; positively charged (basic) amino acids include 

30 arginine, lysine, and histidine; and negatively charged 

(acidic) amino acids include aspartic acid and glutamic acid. 

A variety of host-expression vector systems can be used 
to express the NHP nucleotide sequences of the invention. 
Where, as in the present instance, the NHP products or NHP 

3 5 polypeptides are thought to be soluble or secreted molecules, 

the peptide or polypeptide can be recovered from the culture 
media. Such expression systems also encompass engineered host 
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cells that express a NHP, or a functional equivalent, in situ. 
Purification or enrichment of NHP from such expression systems 
can be accomplished using appropriate detergents and lipid 
micelles and methods well known to those skilled in the art. 
5 However, such engineered host cells themselves may be used in 
situations where it is important not only to retain the 
structural and functional characteristics of the NHP, but to 
assess biological activity, e.g., in drug screening assays. 

The expression systems that may be used for purposes of 

10 the invention include but are not limited to microorganisms 

such as bacteria {e.g., E. coli, B. subtilis) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing NHP nucleotide sequences; yeast 
(e.g., Saccharomyces , Pichia) transformed with recombinant 

15 yeast expression vectors containing NHP encoding nucleotide 

sequences; insect -cell systems infected with recombinant virus 
expression vectors (e.g., baculovirus) containing NHP 
sequences; plant cell systems infected with recombinant virus 
expression vectors (e.g., cauliflower mosaic virus, CaMV; 

20 tobacco mosaic virus, TMV) or transformed with recombinant 

plasmid expression vectors (e.g., Ti plasmid) containing NHP 
nucleotide sequences; or mammalian cell systems ( e.g., COS, 
CHO, BHK, 293, 3T3) harboring recombinant expression 
constructs containing promoters derived from the genome of 

25 mammalian cells (e.g., metallothionein promoter) or from 
mammalian viruses (e.g., the adenovirus late promoter; the 
vaccinia virus 7 . 5K promoter). 

In bacterial systems, a number of expression vectors may 
be advantageously selected depending upon the use intended for 

3 0 the NHP product being expressed. For example, when a large 
quantity of such a protein is to be produced for the 
generation of pharmaceutical compositions of or containing 
NHP, or for raising antibodies to a NHP , vectors that direct 
the expression of high levels of fusion protein products that 

35 are readily purified may be desirable. Such vectors include, 
but are not limited, to the E. coli expression vector pUR278 
(Ruther et al . , 1983, EMBO J. 2:1791), in which a NHP coding 
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sequence may be ligated individually into the vector in frame 
with the lacZ coding region so that a fusion protein is 
produced; pIN vectors (Inouye & Inouye, 1985, Nucleic Acids 
Res. 23:3101-3109; Van Heeke & Schuster, 1989, J. Biol. Chem. 
5 264: 5503-5509) ; and the like. pGEX vectors (Pharmacia or 

American Type Culture Collection) can also be used to express 
foreign polypeptides as fusion proteins with glutathione 
S- transferase (GST) . In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by 

10 adsorption to glutathione-agarose beads followed by elution in 
the presence of free glutathione. The PGEX vectors are 
designed to include thrombin or factor Xa protease cleavage 
sites so that the cloned target gene product can be released 
from the GST moiety. 

15 In an insect system, Autographa californica nuclear 

polyhidrosis virus (AcNPV) is used as a vector to express 
foreign genes. The virus grows in Spodoptera frugiperda 
cells. A NHP coding sequence can be cloned individually into 
non-essential regions (for example the polyhedrin gene) of the 

2 0 virus and placed under control of an AcNPV promoter (for 

example the polyhedrin promoter) . Successful insertion of NHP 
coding sequence will result in inactivation of the polyhedrin 
gene and production of non-occluded recombinant virus (i.e., 
virus lacking the proteinaceous coat coded for by the 

25 polyhedrin gene) . These recombinant viruses are then used to 
infect Spodoptera frugiperda cells in which the inserted 
sequence is expressed (e.g., see Smith et al . , 1983, J. 
Virol. 46: 584; Smith, U.S. Patent No . 4,215,051). 

In mammalian host cells, a number of viral-based 

30 expression systems may be utilized. In cases where an 

adenovirus is used as an expression vector, the NHP nucleotide 
sequence of interest may be ligated to an adenovirus 
transcription/ translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric 

35 sequence may then be inserted in the adenovirus genome by in 
vitro or in vivo recombination. Insertion in a non-essential 
region of the viral genome (e.g., region El or E3 ) will result 
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in a recombinant virus that is viable and capable of 
expressing a NHP product in infected hosts (e.g., See Logan & 
Shenk, 1984, Proc . Natl. Acad. Sci . USA 51:3 655-3 659). 
Specific initiation signals may also be required for efficient 
5 translation of inserted NHP nucleotide sequences. These 
signals include the ATG initiation codon and adjacent 
sequences. In cases where an entire NHP gene or cDNA, 
including its own initiation codon and adjacent sequences, is 
inserted into the appropriate expression vector, no additional 

10 translational control signals may be needed. However, in 
cases where only a portion of a NHP coding sequence is 
inserted, exogenous translational control signals, including, 
perhaps, the ATG initiation codon, must be provided. 
Furthermore, the initiation codon must be in phase with the 

15 reading frame of the desired coding sequence to ensure 
translation of the entire insert. These exogenous 
translational control signals and initiation codons can be of 
a variety of origins, both natural and synthetic. The 
efficiency of expression may be enhanced by the inclusion of 

20 appropriate transcription enhancer elements, transcription 
terminators, etc. (See Bittner et al . , 1987, Methods in 
Enzymol. 153:516-544) . 

In addition, a host cell strain may be chosen that 
modulates the expression of the inserted sequences, or 

25 modifies and processes the .gene product in the specific 

fashion desired. Such modifications (e.g./ glycosylation) and 
processing (e.g., cleavage) of protein products may be 
important for the function of the protein. Different host 
cells have characteristic and specific mechanisms for the 

30 post-translational processing and modification of proteins and 
gene products. Appropriate cell lines or host systems can be 
chosen to ensure the correct modification and processing of 
the foreign protein expressed. To this end, eukaryotic host 
cells which possess the cellular machinery for proper 

3 5 processing of the primary transcript, glycosylation, and 
phosphorylation of the gene product may be used. Such 
mammalian host cells include, but are not limited to, CHO, 
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VERO, BHK, HeLa, COS, MDCK , 293, 3T3 , WI3 8, and in particular, 
human cell lines. 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For example, cell 
5 lines which stably express the NHP sequences described above 
can be engineered. Rather than using expression vectors which 
contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer sequences, 

10 transcription terminators, polyadenylation sites, etc.), and a 
selectable marker. Following the introduction of the foreign 
DNA, engineered cells may be allowed to grow for 1-2 days in 
an enriched media, and then are switched to a selective media. 
The selectable marker in the recombinant plasmid confers 

15 resistance to the selection and allows cells to stably 

integrate the plasmid into their chromosomes and grow to form 
foci which in turn can be cloned and expanded into cell lines. 
This method may advantageously be used to engineer cell lines 
which express the NHP product. Such engineered cell lines may 

20 be particularly useful in screening and evaluation of 

compounds that affect the endogenous activity of the NHP 
product . 

A number of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase 
25 (Wigler, et al . , 1977, Cell 11:223), hypoxan thine- guanine 

phosphoribosyltransf erase (Szybalska & Szybalski, 1962, Proc. 
Natl. Acad. Sci . USA 48:2026), and adenine 

phosphoribosyltransferase (Lowy, et al . , 1980, Cell 22:817) 
genes can be employed in tk", hgprt" or aprt cells, 

30 respectively. Also, antimetabolite resistance can be used as 
the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler, et al., 1980, 
Natl. Acad. Sci. USA 77:3567; O'Hare, et al . , 1981, Proc. 
Natl. Acad. Sci. USA 75:1527); gpt, which confers resistance 

35 to mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. 
Sci. USA 78:2072); neo , which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin, et al . , 1981, J. Mol. 

20 
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Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al . , 1984, Gene 30:147). 

Alternatively, any fusion protein can be readily purified 
by utilizing an antibody specific for the fusion protein being 
5 expressed. For example, a system described by Janknecht et 
al . allows for the ready purification of non-denatured fusion 
proteins expressed in human cell lines (Janknecht, et al. , 
1991, Proc. Natl. Acad. Sci. USA 85:8972-897 6). In this 
system, the sequence of interest is subcloned into a vaccinia 

10 recombination plasmid such that the gene's open reading frame 
is translationally fused to an amino- terminal tag consisting 
of six histidine residues. Extracts from cells infected with 
recombinant vaccinia virus are loaded onto Ni 2+ • ni triloace tic 
acid-agarose columns and histidine- tagged proteins are 

15 selectively eluted with imidazole-containing buffers. 

Also encompassed by the present invention are novel protein 
constructs engineered in such a way that they facilitate 
transport of the NHP to the target site, to the desired organ, 
across the cell membrane and/or to the nucleus where the NHP 

20 can exert its function activity. This goal may be achieved by 
coupling of the NHP to a cytokine or other ligand that would 
direct the NHP to the target organ and facilitate receptor 
mediated transport across the membrane into the cytosol . 
Conjugation of NHPs to antibody molecules or their Fab 

25 fragments could be used to target cells bearing a particular 
epitope. Attaching the appropriate signal sequence to the NHP 
would also transport the NHP to the desired location within 
the cell. Alternatively targeting of NHP or its nucleic acid 
sequence might be achieved using liposome or lipid complex 

30 based delivery systems. Such technologies are described in 

Liposomes: A Practical Approach , New RRC ed. , Oxford University 
Press, New York and in U.S. Patents Nos . 4,594,595, 5,459,127, 
5,948,767 and 6,110,490 and their respective disclosures which 
are herein incorporated by reference in their entirety. 
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5.3 ANTIBODIES TO NHP PRODUCTS 
Antibodies that specifically recognize one or more 
epitopes of a NHP , or epitopes of conserved variants of a NHP, 
5 or peptide fragments of a NHP are also encompassed by the 
invention. Such antibodies include but are not limited to 
polyclonal antibodies, monoclonal antibodies (mAbs) , humanized 
or chimeric antibodies, single chain antibodies, Fab 
fragments, F(ab') 2 fragments, fragments produced by a Fab 

10 expression library, anti-idiotypic (anti-Id) antibodies, and 
epitope-binding fragments of any of the above. 

The antibodies of the invention may be used, for example, 
in the detection of NHP in a biological sample and may, 
therefore, be utilized as part of a diagnostic or prognostic 

15 technique whereby patients may be tested for abnormal amounts 
of NHP. Such antibodies may also be utilized in conjunction 
with, for example, compound screening schemes for the 
evaluation of the effect of test compounds on expression 
and/or activity of a NHP gene product. Additionally, such 

20 antibodies can be used in conjunction gene therapy to, for 

example, evaluate the normal and/or engineered NHP- expressing 
cells prior to their introduction into the patient. Such 
antibodies may additionally be used as a method for the 
inhibition of abnormal NHP activity. Thus, such antibodies 

25 may, therefore, be utilized as part of treatment methods. 

For the production of antibodies, various host animals 
may be immunized by injection with the NHP, an NHP peptide 
(e.g., one corresponding the a functional domain of an NHP), 
truncated NHP polypeptides (NHP in which one or more domains 

3 0 have been deleted) , functional equivalents of the NHP or 

mutated variant of the NHP. Such host animals may include but 
are not limited to pigs, rabbits, mice, goats, and rats, to 
name but a few. Various adjuvants may be used to increase the 
immunological response, depending on the host species, 

35 including but not limited to Freund's adjuvant (complete and 
incomplete) , mineral salts such as aluminum hydroxide or 
aluminum phosphate, surface active substances such as 
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lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, and potentially useful human adjuvants such as BCG 
(bacille Calmette-Guerin) and Corynebacterium parvum. 
Alternatively, the immune response could be enhanced by 
5 combination and or coupling with molecules such as keyhole 
limpet hemocyanin, tetanus toxoid, diptheria toxoid, 
ovalbumin, cholera toxin or fragments thereof. Polyclonal 
antibodies are heterogeneous populations of antibody molecules 
derived from the sera of the immunized animals. 

10 Monoclonal antibodies, which are homogeneous populations 

of antibodies to a particular antigen, can be obtained by any 
technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include, 
but are not limited to, the hybridoma technique of Kohler and 

15 Milstein, (1975, Nature 255:495-497; and U.S. Patent No . 

4,376,110), the human B-cell hybridoma technique (Kosbor et 
al., 1983, Immunology Today 4: 72; Cole et al . , 1983, Proc. 
Natl. Acad. Sci . USA 50:2026-2030), and the EBV-hybridoma 
technique (Cole et al . , 1985, Monoclonal Antibodies And Cancer 

20 Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may 
be of any immunoglobulin class including igG, IgM, IgE, IgA, 
IgD and any subclass thereof. The hybridoma producing the mAb 
of this invention may be cultivated in vitro or in vivo. 
Production of high titers of mAbs in vivo makes this the 

25 presently preferred method .of production. 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison et al > , 1984, Proc. Natl. 
Acad. Sci., 81:68 51-68 55; Neuberger et al . , 1984, Nature, 
312:604-608; Takeda et al . , 1985, Nature, 314:452-454) by 

3 0 splicing the genes from a mouse antibody molecule of 

appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity can 
be used. A chimeric antibody is a molecule in which different 
portions are derived from different animal species, such as 

35 those having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. Such technologies are 
described in U.S. Patents Nos. 6,075,181 and 5,877,397 and 
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their respective disclosures which are herein incorporated by 
reference in their entirety. 

Alternatively, techniques described for the production of 
single chain antibodies (U.S. Patent 4,946,778; Bird, 1988, 
5 Science 242:423-426; Huston et al . , 1988, Proc . Natl. Acad. 
Sci. USA 85:5879-5883; and Ward et al . , 1989, Nature 334:544- 
546) can be adapted to produce single chain antibodies against 
NHP gene products. Single chain antibodies are formed by 
linking the heavy and light chain fragments of the Fv region 
10 via an amino acid bridge, resulting in a single chain 
polypeptide . 

Antibody fragments which recognize specific epitopes may 
be generated by known techniques. For example, such fragments 
include, but are not limited to: the F(ab') 2 fragments which 

15 can be produced by pepsin digestion of the antibody molecule 
and the Fab fragments which can be generated by reducing the 
disulfide bridges of the F(ab') 2 fragments. Alternatively, Fab 
expression libraries may be constructed (Huse et al . , 1989, 
Science, 246 1 1275-1281 ) to allow rapid and easy identification 

20 of monoclonal Fab fragments with the desired specificity. 

Antibodies to a NHP can, in turn, be utilized to generate 
anti-idiotype antibodies that "mimic" a given NHP, using 
techniques well known to those skilled in the art. (See, 
e.g., Greenspan & Bona, 1993, FASEBJ 7 (5) : 437-444 ; and 

25 Nissinoff, 1991, J. Immunol. 14 7 (8) : 2 42 9-243 8 ) . For example 
antibodies which bind to a NHP domain and competitively 
inhibit the binding of NHP to its cognate receptor can be used 
to generate anti-idiotypes that "mimic" the NHP and, 
therefore, bind and activate or neutralize a receptor. Such 

30 anti-idiotypic antibodies or Fab fragments of such anti- 
idiotypes can be used in therapeutic regimens involving a NHP 

signaling pathway. 

The present invention is not to be limited in scope by 
the specific embodiments described herein, which are intended 
35 as single illustrations of individual aspects of the 

invention, and functionally equivalent methods and components 
are within the scope of the invention. Indeed, various 
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modifications of the invention, in addition to those shown and 
described herein will become apparent to those skilled in the 
art from the foregoing description. Such modifications are 
intended to fall within the scope of the appended claims. All 
5 cited publications, patents, and patent applications are 
herein incorporated by reference in their entirety. 
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1. An isolated nucleic acid molecule comprising at 
least 24 contiguous bases of nucleotide sequence first 

5 disclosed in the NHP polynucleotide described in SEQ ID NO: 1. 

2. An isolated nucleic acid molecule comprising a 
nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID 
10 NO: 2; and 

(b) hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO; 1 or the 
complement thereof. 



15 3 . An isolated nucleic acid molecule encoding the 

amino acid sequence described in SEQ ID NO : 2. 
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SEQUENCE LISTING 

<110> Walke, D. Wade 

Turner, C. Alexander Jr. 
Abuin, Alejandro 
Friedreich, Glenn 
Zambrowicz, Brian 
Sands, Arthur T. 

<12 0> Novel Human Proteases and 

Polynucleotides Encoding the Same 

<130> LEX-0108-PCT 

<150> US 60/171,566 
<151> 1999-12-22 

<160> 7 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 921 
<212> DNA 

<213> Homo sapiens 



<400> 1 

atgagtctca aaatgcttat aagcaggaac aagctgattt tactactagg aatagtcttt 60 

tttgaacrag gtaaatctgc arctctttcg ctccccaaag ctcccagttg tgggcagagt 120 

ctggttaagg tacagccttg gaattatttt aacattttca gtcgcattct tggaggaagc 180 

caagtggaga agggttccta tccctggcag gtatctctga aacaaaggca gaagcatatt 2 40 

tgtggaggaa gcatcgtctc accacagtgg gtgatcacgg cggctcactg cattgcaaac 3 00 

agaaacattg tgtctacttt gaatgttact gctggagagt atgacttaag ccagacagac 3 60 

ccaggagagc aaactctcac tattgaaact gtcatcatac atccacattt ctccaccaag 420 

aaaccaatgg actatgatat tgcccttttg aagatggctg gagccttcca atttggccac 480 

tttgtggggc ccatatgtct tccagagctg cgggagcaat ttgaggctgg ttttatttgt 540 

acaactgcag gctggggccg cttaactgaa ggtggcgtcc tctcacaagt cttgcaggaa 600 

gtgaatctgc ctattttgac ctgggaagag tgtgtggcag ctctgttaac actaaagagg 6 60 

cccatcagtg ggaagacctt tctttgcaca ggttttcctg atggagggag agacgcatgt 720 

cagggagatt caggaggttc actcatgtgc cggaataaga aaggggcctg gactctggct 780 

ggtgtgactt cctggggttt gggctgtggt cgaggctgga gaaacaatgt gaggaaaagt 8 40 

gatcaaggat cccctgggat cttcacagac attagtaaag tgctttcctg gatccacgaa 900 

cacatccaaa ctggtaacta a 921 



<210> 2 

<211> 306 

<212> PRT 

<213> Homo sapiens 



<400> 2 
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16b 


Lys 


Leu 


Pro 


Glu 


Leu 
170 


Arg 


Glu 


Gin 


pne 


Glu 
17 5 


Ala 


Gly 


Phe 


He 


Cys 
180 


Thr 


Thr 


Ala 


Gly 


Trp 
185 


Gly 


Arg 


Leu 


Thr 


Glu 
190 


Gly 


Gly 


Val 


Leu 


Ser 
195 


Gin 


Val 


Leu 


Gin 


Glu 
200 


Val 


Asn 


Leu 


Pro 


He 
205 


Leu 


Thr 


Trp 


Glu 


Glu 
210 


Cys 


Val 


Ala 


Ala 


Leu 
215 


Leu 


Thr 


Leu 


Lys 


Arg 
220 


Pro 


He 


Ser 


Gly 


Lys 


Thr 


Phe 


Leu 


Cys 


Thr 


Gly 


Phe 


Pro 


Asp 


Gly 


Gly 


Arg 


Asp 


Ala 


Cys 


225 










230 










235 










240 


Gin 


Gly 


Asp 


Ser 


Gly 
245 


Gly 


Ser 


Leu 


Met 


Cys 
250 


Arg 


Asn 


Lys 


Lys 


Gly 
255 


Ala 


Trp 


Thr 


Leu 


Ala 
260 


Gly 


Val 


Thr 


Ser 


Trp 
265 


Gly 


Leu 


Gly 


Cys 


Gly 
270 


Arg 


Gly 


Trp 


Arg 


Asn 
275 


Asn 


Val 


Arg 


Lys 


Ser 
280 


Asp 


Gin 


Gly 


Ser 


Pro 
285 


Gly 


He 


Phe 


Thr 


Asp 
290 


He 


Ser 


Lys 


Val 


Leu 
295 


Ser 


Trp 


He 


His 


Glu 
300 


His 


He 


Gin 


Thr 



Gly Asn 
305 



<210> 3 

<211> 909 

<212> DNA 

<2 1 3 > Homo sapiens 



<400> 3 

atgcttataa 

aaatctgcar 

cagccttgga 

ggttcctatc 

atcgtctcac 

tctactttga 

actctcacta 

tatgatattg 

atatgtcttc 

tggggccgct 

attttgacct 

aagacctttc 

ggaggttcac 

tggggtttgg 

cctgggatct 

ggtaactaa 



gcaggaacaa 
ctctttcgct 
attattttaa 
cctggcaggt 
cacagtgggt 
atgttactgc 
ttgaaactgt 
cccttttgaa 
cagagctgcg 
taactgaagg 
gggaagagtg 
tttgcacagg 
tcatgtgccg 
gctgtggtcg 
tcacagacat 



gctgatttta 

ccccaaagct 

cattttcagt 

atctctgaaa 

gatcacggcg 

tggagagtat 

catcacacat 

gatggctgga' 

ggagcaattt 

tggcgtcctc 

tgtggcagct 

ttttcctgat 

gaataagaaa 

aggctggaga 

tagtaaagtg 



ctactaggaa 
cccagttgtg 
cgcattcttg 
caaaggcaga 
gctcactgca 
gacttaagcc 
ccacatttct 
gccttccaat 
gaggctggtt 
tcacaagtct 
ctgttaacac 
ggagggagag 
ggggcctgga 
aacaatgtga 
ctttcctgga 



tagtcttttt 
ggcagagtct 
gaggaagcca 
agcatatttg 
ttgcaaacag 
agacagaccc 
ccaccaagaa 
ttggccactt 
ttatttgtac 
tgcaggaagt 
taaagaggcc 
acgcatgtca 
ctctggctgg 
ggaaaagtga 
tccacgaaca 



tgaacraggt 
ggttaaggta 
agtggagaag 
tggaggaagc 
aaacattgtg 
aggagagcaa 
accaatggac 
tgtggggccc 
aactgcaggc 
gaatc tgcct 
catcagtggg 
gggagattca 
tgtgacttcc 
tcaaggatcc 
catccaaact 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
909 



<210> 4 
<211> 302 
<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Leu He Ser 
1 

Phe Glu Arg Gly 

20 

Cys Gly Gin Ser 



Arg Asn Lys Leu 
5 

Lys Ser Ala Ala 
Leu Val Lys Val 



He Leu Leu Leu 
10 

Leu Ser Leu Pro 
25 

Gin Pro Trp Asn 



Gly He Val Phe 
15 

Lys Ala Pro Ser 
30 

Tyr Phe Asn He 
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35 



Phe 


Ser 
50 


Arg 


lie 


Leu 


Gly 


Trp 


Gin 


Val 


Ser 


Leu 


Lys 


65 










70 


lie 


Val 


Ser 


Pro 


Gin 
85 


Trp 


Arg 


Asn 


lie 


Val 
100 


Ser 


Thr 


Ser 


Gin 


Thr 
115 


Asp 


Pro 


Gly 


lie 


His 
130 


Pro 


His 


Phe 


Ser 


Leu 


Leu 


Lys 


Met 


Ala 


Gly 


145 










150 


lie 


Cys 


Leu 


Pro 


Glu 
165 


Leu 


Thr 


Thr 


Ala 


Glv 
180 


Tro 


Glv 


Val 


Leu 


Gin 
195 


Glu 


Val 


Asn 


Ala 


Ala 
210 


Leu 


Leu 


Thr 


Leu 


Cvs 


Thr 


Gly 


Phe 


Pro 


Asp 


225 










230 


Gly 


Gly 


Ser 


Leu 


Met 
245 


Cys 


Gly 


Val 


Thr 


Ser 
260 


Trp 


Gly 


Val 


Arg 


Lys 
275 


Ser 


Asp 


Gin 


Lys 


Val 
290 


Leu 


Ser 


Trp 


He 



40 



Gly 


Ser 


Gin 


Val 


Glu 


Lvs 


55 










60 


Gin 


Arg 


Gin 


Lys 


His 


He 










75 




Val 


He 


Thr 


Ala 


Ala 


His 








90 






Leu 


Asn 


Val 


Thr 


Ala 


Gly 






105 








Glu 


Gin 


Thr 


Leu 


Thr 


He 




120 










Thr 


Lys 


Lys 


Pro 


Met 


Asp 


135 










140 


Ala 


Phe 


Gin 


Phe 


Gly 


His 










155 




Ara 


Glu 


Gin 


Phe 


Glu 


Ala 








170 






Ara 




Thr 


Glu 


Glv 


Glv 






185 








i_r trr i_i 


Pro 


He 




Thr 


Trn 




200 










Lys 


Ara 


Pro 


He 


Ser 


Glv 


215 










220 


Glv 


Glv 


Arcr 


A^O 


Ala 


Cvs 










235 




Arg 


Asn 


Lys 


Lys 


Gly 


Ala 








250 






Leu 


Gly 


Cys 


Gly 


Arg 


Gly 






265 








Gly 


Ser 


Pro 


Gly 


He 


Phe 




280 










His 


Glu 


His 


He 


Gin 


Thr 


295 










300 



PCT/US00/33738 

45 



Glv 


Ser 


Tvr 


Pro 


Cys 


Gly 


Gly 


Ser 








80 


Cvs 


He 


Ala 


Asn 






95 




Glu 


Tyr 


Asp 


Leu 




110 






Glu 


Thr 


Val 


He 


125 








Tvr 


Asp 


He 


Ala 


Phe 


Val 


Gly 


Pro 








160 


Glv 


IT l I ^ 


Tip 


Cvs 






175 




udl 


Lie U 




Ulil 




-L -7 VJ 






pin 


nl n 




v d J. 


i V J 












php 


T.PU 

1—1 


Gin 


Gly 


Asp 


Ser 








240 


Trp 


Thr 


Leu 


Ala 






255 




Trp 


Arg 


Asn 


Asn 




270 






Thr 


Asp 


He 


Ser 


285 








Gly 


Asn 







<210> 5 

<211> 495 

<212> DNA 

<213> Homo sapiens 



<400> 5 

atggactatg 

gggeccatat 

gcaggctggg 

ctgcctattt 

agtgggaaga 

gattcaggag 

acttcctggg 

ggatcccctg 

caaactggta 



atattgeect 
gtcttccaga 
gccgcttaac 
tgacctggga 
cctttctttg 
gttcactcat 
gtttgggctg 
ggatcttcac 
actaa 



tttgaagatg 
getgegggag 
tgaaggtggc 
agagtgtgtg 
cacaggtttt 
gtgccggaat 
tggtcgaggc 
agacat tagt 



getggagect 
caatttgagg 
gtcctctcac 
gcagctctgt 
cctgatggag 
aagaaagggg 
tggagaaaca 
aaagtgcttt 



tccaatt tgg 
ctggttttat 
aagtcttgea 
taacactaaa 
ggagagaege 
cctggactct 
atgtgaggaa 
cctggatcca 



ccactttgtg 
ttgtacaact 
ggaagtgaat 
gaggcccatc 
atgtcaggga 
ggctggtgtg 
aagtgatcaa 
cgaacacatc 



60 
120 
180 
240 
300 
360 
420 
480 
495 



<210> 6 

<211> 164 

<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Asp Tyr Asp 
1 

Gly His Phe Val 

20 

Glu Ala Gly Phe 
35 

Gly Gly Val Leu 



He Ala Leu Leu 
5 

Gly Pro He Cys 

He Cys Thr Thr 

40 

Ser Gin Val Leu 



Lys Met Ala Gly 
10 

Leu Pro Glu Leu 
25 

Ala Gly Trp Gly 
Gin Glu Val Asn 



Ala Phe Gin Phe 
15 

Arg Glu Gin Phe 
30 

Arg Leu Thr Glu 
45 

Leu Pro He Leu 
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50 










55 










60 










Thr 


Trp 


Glu 


Glu 


Cys 


Val 


Ala 


Ala 


Leu 


Leu 


Thr 


Leu 


Lys 


Arg 


Pro 


He 


65 










70 










75 










80 


Ser 


Gly 


Lys 


Thr 


Phe 


Leu 


Cys 


Thr 


Gly 


Phe 


Pro 


Asp 


Gly 


Gly 


Arg 


Asp 










85 










90 










95 




Ala 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Ser 


Leu 


Met 


Cys 


Arg 


Asn 


Lys 


Lys 








100 










105 










110 






Gly 


Ala 


Trp 


Thr 


Leu 


Ala 


Gly 


Val 


Thr 


Ser 


Trp 


Gly 


Leu 


Gly 


Cys 


Gly 






115 










120 










125 








Arg 


Gly 


Trp 


Arg 


Asn 


Asn 


Val 


Arg 


Lys 


Ser 


Asp 


Gin 


Gly 


Ser 


Pro 


Gly 




130 










135 










140 










lie 


Phe 


Thr 


Asp 


lie 


Ser 


Lys 


Val 


Leu 


Ser 


Trp 


He 


His 


Glu 


His 


He 


145 










150 










155 










160 


Gin 


Thr 


Gly 


Asn 



























<210> 
<211> 
<212> 
<213> 



7 

1568 

DNA 

Homo 



<400> 7 

catacaccat 

ctagaagcta 

ttccatttca 

ggactggctc 

aactatgagt 

ctttt ttgaa 

gagtctggtt 

aagccaagtg 

tatttgtgga 

aaacagaaac 

agacccagga 

caagaaacca 

ccactttgtg 

ttgtacaact 

ggaagtgaat 

gaggcccatc 

atgtcaggga 

ggctggtgtg 

aagtgatcaa 

cgaacacatc 

agggccagag 

tacctctgtt 

agagtaaaat 

aatgaaaaat 

tcaattgatg 

attacaccga 

agttatca 



sapiens 



agtctcagac 
tctaatcatt 
ggtgtcgtga 
aaaggcttcc 
ctcaaaatgc 
craggtaaat 
aaggtacagc 
gagaagggtt 
ggaagcatcg 
attgtgtcta 
gagcaaactc 
atggactatg 
gggcccatat 
gcaggctggg 
ctgcctattt 
agtgggaaga 
gattcaggag 
acttcctggg 
ggatcccctg 
caaactggta 
acagcatcag 
ctcagtgtag 
atggtaaaaa 
gaatccatcc 
atatgctttc 
tggcaccact 



tcagtttcat 

tagtttgtct 

aaagcttgaa 

tatttttgtt 

ttataagcag 

ctgcarctct 

cttggaatta 

cc tatccctg 

tctcaccaca 

ctttgaatgt 

tcactactga 

atattgccct 

gtcttccaga 

gccgcttaac 

tgacctggga 

cctttctttg 

gttcactcat 

gtttgggctg 

ggatcttcac 

actaagccat 

cagagtcctg' 

cacacaagga 

aaaaaaaaaa 

atcctagacc 

accacttact 

tacttctcaa 



gggtgaaatg 
cgtt tttttc 
ttcggcgcgc 
tgctttagtc 
gaacaagc tg 
ttcgctcccc 
ttttaacatt 
gcaggtatct 
gtgggtgatc 
tactgctgga 
aactgtcatc 
tttgaagatg 
gctgcgggag 
tgaaggtggc 
agagtgtgtg 
cacaggtttt 
gtgccggaat 
tggtcgaggc 
agacattagt 
cacacaaggt 
gcaaatcaga 
ttgtgaggtt 
aaaaaaatgt 
ttctcaaact 
tctctggatt 
aaaaatccag 



gagaagatat 
tctggttcaa 
cagatatcac 
tctctaaaat 
attttactac 
aaagctccca 
ttcagtcgca 
ctgaaacaaa 
acggcggctc 
gagtatgact 
atacatccac 
gctggagcct 
caatttgagg 
gtcctctcac 
gcagctctgt 
cctgatggag 
aagaaagggg 
tggagaaaca 
aaagtgcttt 
taagaagctg 
gcacctgaac 
taccaagtct 
ggagcagcat 
ggcctttaat 
cagagtccag 
caaactataa 



tacctcaatc 60 

agtttttttc 120 

acgtgccaag 180 

ttcagggaaa 240 

taggaatagt 300 

gttgtgggca 360 

ttcttggagg 420 

ggcagaagca 480 

actgcattgc 540 

taagccagac 600 

atttctccac 660 

tccaatttgg 720 

ctggttttat 780 

aagtcttgca 840 

taacactaaa 900 

ggagagacgc 9 60 

cctggactct 1020 

atgtgaggaa 1080 

cctggatcca 1140 

ccattctgct 1200 

caacagtctc 1260 

aaataaaaca 13 2 0 

gcagatattc 1380 

tgaaactatc 1440 

agtgctcacc 1500 

ccagatcagt 15 60 

1568 
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